Estimation of mental lexicon size with word familiarity database
نویسندگان
چکیده
A familiarity database was developed for about 80,000 Japanese words of which familiarity scores were rated by 32 Japanese adults using a 7-point scale in auditory, visual, and audio-visual modalities. Auditory, visual, and audio-visual stimulus words were selected from the database according to their word familiarity for size estimation of the mental lexicon. Sixty Japanese adults participated in a two-alternative forcedchoice task (Know–Don't know) for the stimulus words. The size of the mental lexicon was estimated as the number of words of which familiarity is above a particular word corresponding to 50% point on the fitted logistic curve to “know”-response probability of the stimulus words. The estimated size was about 68,000 for auditory words, and about 66,000 both for visual and audio-visual words when homophones and homographs were included. The results suggest that very small difference in the mental lexicon size among modalities.
منابع مشابه
Sonority Contours in Speech Recognition
for their invaluable input into this paper. All errors are my own. The sonority scale that ranks phonemes according to relative " loudness " has long played a significant role in the fields of Phonology and Historical Linguistics, yet it is conspicuously absent from the speech recognition literature. In this preliminary study using the Hoosier Mental Lexicon, it was found that approximately hal...
متن کاملBuilding a Lexical Database for an Interactive Joke-Generator
As part of a project to construct an interactive program which will encourage children to play with language by building jokes, we have developed a large lexical database, closely based on WordNet. As well as the standard WordNet information about part of speech, synonymy, hyponymy, etc, we have added phonetic representations and symbolic links allowing attachment of pictures. All information i...
متن کاملSubjective Familiarity of Words: Analysis of the Hoosier Mental Lexicon
We conducted a statistical analysis of several subsets of words from the Hoosier Mental Lexicon in order to examine some factors underlying the subjective familiarity ratings collected by Nusbaum, Pisoni, and Davis (1984). In this analysis, we grouped words into High-FAM (average familiarity rating greater than 6 on a 7-point scale), Mid-FAM (between 4.5 and 3.5), or Low-FAM (less than 2) sets,...
متن کاملPerception of Wordlikeness: Effects of Segment Probability and Length on Subjective Ratings and Processing of Nonword Sound Patterns1
A stochastic phonological grammar based on positional constituent probabilities of words in the mental lexicon was used to generate nonword sound patterns. The subjective wordlikeness of these patterns was evaluated by naïve listeners using a 7-point rating scale. Subjective wordlikeness ratings were lower for nonwords containing low-probability constituents and were also lower for nonwords wit...
متن کاملAffix Productivity and Base Productivity
Morphological productivity is generally seen as affix-driven, with a given affix selecting base words satisfying a range of formal and semantic selectional restrictions. Plag (1999), however, has shown that there are base-driven selectional restrictions, and Hay (2000) and Hay & Baayen (2002) have shown that the frequency relation between derived and base word is correlated with affix productiv...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998